Application of Convolutional Neural Network for Image Classification on Pascal VOC Challenge 2012 dataset
نویسنده
چکیده
In this project we work on creating a model to classify images for the Pascal VOC Challenge 2012. We use convolutional neural networks trained on a single GPU instance provided by Amazon via their cloud service Amazon Web Services (AWS) to classify images in the Pascal VOC 2012 data set. We train multiple convolutional neural network models and finally settle on the best model which produced a validation accuracy of 85.6% and a testing accuracy of 85.24%. 1. Introduction Using convolutional neural networks (CNN) for image classification has become the de facto standard largely due to the success of Krizhevsky et al. [1], Szegedy et al. [2], Simonyan and Zisserman [3] and especially He et al. [4], which is now the stateofthe art architecture for image classification. By stacking convolutional layers on top of each other, amongst other architectural artifacts, highly accurate models for task of image classification, detection and segmentation have been discover. These models have been able to nearly match, if not exceed human performance on certain datasets. CNNs are not without their own shortcomings though. Due to the sheer size of networks and the millions of parameters to be optimized, the reliance of highperformance systems increases dramatically. Though modern GPUbased systems are capable of perfoming the intensive computations required by a convolutional neural network, implementing a CNN is unsuitable for those without access to such a high performance system. Also, more complex architectures such as those by [4] take up to 23 weeks to train which demonstrate the inherent difficulties in training deep convolutional neural networks. However, given the fact that CNNs provide the best results compared to other image classification models and also motivated by the credibility of the performance of CNNs by [1]–[4], we train multiple
منابع مشابه
Learning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملA Radon-based Convolutional Neural Network for Medical Image Retrieval
Image classification and retrieval systems have gained more attention because of easier access to high-tech medical imaging. However, the lack of availability of large-scaled balanced labelled data in medicine is still a challenge. Simplicity, practicality, efficiency, and effectiveness are the main targets in medical domain. To achieve these goals, Radon transformation, which is a well-known t...
متن کاملGeneric Object Detection with Dense Neural Patterns and Regionlets
This paper addresses the challenge of establishing a bridge between deep convolutional neural networks and conventional object detection frameworks for accurate and efficient generic object detection. We introduce Dense Neural Patterns, short for DNPs, which are dense local features derived from discriminatively trained deep convolutional neural networks. DNPs can be easily plugged into convent...
متن کاملA Convolutional Neural Network based on Adaptive Pooling for Classification of Noisy Images
Convolutional neural network is one of the effective methods for classifying images that performs learning using convolutional, pooling and fully-connected layers. All kinds of noise disrupt the operation of this network. Noise images reduce classification accuracy and increase convolutional neural network training time. Noise is an unwanted signal that destroys the original signal. Noise chang...
متن کاملFully Connected Deep Structured Networks
Convolutional neural networks with many layers have recently been shown to achieve excellent results on many high-level tasks such as image classification, object detection and more recently also semantic segmentation. Particularly for semantic segmentation, a two-stage procedure is often employed. Hereby, convolutional networks are trained to provide good local pixel-wise features for the seco...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1607.03785 شماره
صفحات -
تاریخ انتشار 2016